
Europaisches Patsntamt 
European Patent Office 
Office europeen des brevets 






@ Publication number : 0 495 622 A2 



@ 



EUROPEAN PATENT APPLICATION 



Application number : 92300302.4 
@ Date of fiHrtg : 14.01.92 



@) inL ci.^: G06F 15/401, G06F 15/403 



@ Priority : 14.01.91 GB 9100733 

@ Date of publication of application : 
22.07.92 Bulletin 92/30 



@ Designated Contracting States : 
DE FR GB 



(71) Applicant : XEROX CORPORATION 
Xerox Square 

Rochester New York 14644 (US) 



@ Inventor: Lamimtng, Michael G. 
179 Hills Road 
Cambndge CB2 2RN (GB) 
Inventor : Newman, William Maxwell 
Yew Tree Cottage, Famham 
Bliaundford Fonim, Dorset (GB) 

@ Reipresentative : Goode, Ian Roy et a! 

Ranit Xerox Patent Department Albion House, 
55 New Oxford Street 
London WCiA 1BS (GB) 



(g) Indexing of data sets. 



(g) A system is disclosed for fecaitafting 

access to a portion of a datastream for retrieval 
purposes based upon identifying "episodes" in 
data from other sources related to a user's 
activities. The stored data, in raw or "episodic " 
form, are accessed as desired using parameters 
based on recollections of the user's or database 
interrogator's activities at the time of the event 
which the user or interrogator wishes to access 
or display. ' Such other sources may indiide 
information about the movements, gatherings, 
conversations, phone calls, etc. of users in a 
particular environment fitted with the system. 



26^ 26. 



Fig. 3. 

t^-cH-n 



.30 



BADGE 
OAIABASE 



28 



BADGE 

EPiSOK 

REOJGNiSER 



38 

10— 
QUERY 



□ 









1 ^ 




1 — ^ 



CM 
< 

CM 
CM 
CD 

in 



Q. 
UJ 



Jouve, 18, rue Saint-Denis, 75001 PARIS 



SDOCIO: <EP 04S5e22A2J_> 



4. 



1 EP 0 485 622 A2 2 



This invention relates to the activity-based index- 
ing of data sets, in order to facilitate the fast retrieval 
at a later date of data of interest. The invention is 
based on there being at least one set of data defining 
activity. The retrieval process invokes identifying an 
index point in a data set. Where there are multiple 
data sets, having index points in one of them facili- 
tates access to the other set to reach the data in it at 
the point in time, or period, contemporaneous with the 
identified index point. 

The invention focuses upon improving access to . 
Information that humans find particularly difficult to 
retrieve, it exploits a suggested autobiographical 
model of human memory which observes that humans 
seem to organise their memory in episodes^ 
Associated with each episode are the properties: 
Where? When? With whom? and, In pursuit of what 
goal? an episode occurred. Humans seem able to* 
recall these key properties of an episode long after the 
details of the episode itself have been forgotten. The. 
present Invention is concerned with building a system 
that enables retrieval of episodes that would other- : 
wise be difficult or impossitile to retrieve, including 
matters for which records are not normally kept This 
system has been called a 'Human .memory pros- 
thesis'. 

The present invention relies on^ a variety of 
apparatuses for nrK>nitoring, recording and time- 
stamping various key aspects of human activity, from ' 
which a readable list of episodes, or 'activity diary', 
can be generated automatically. This diary, whk^h cor- 
relates quite closely with human recollections of past 
episodes, can be used simply to stimulate further 
recall, or to identify an episode and its time of occunr- 
ence, or to show a set of episodes from different sets. 
The times may be used to locate the corresponding 
items in any other set of time-stamped data, such as 
a sequence of video frames or an audk> record of a 
meeting. 

In what follows, an 'episode' is defined as^- a 
period of time containing a memorable sequence of 
events -. By dividing a long series of events into such ^ 
episodes, sequences are formed that are particulariy : . 
memorable. Episodes have flags associated with 
them that characterise them and can be used for ret- 
rieval. 

One form of such apparatus was disclosed in an 
article in The Sunday Times, 11 February 1990, enti- 
tled "Computers acquire social skills', by Jane Bird. 
This mentioned the use of identity buttons to enable 
the computer to know who was where in the building 
at all times. Associated with the system is a network 
of microphones and/or video cameras capable of 
recording conversations, it gave the example of a 
copier in a room monitored by the computer. Upon the 
copier's developing a fault, a frustrated would-be user 
tries to recall what a colleague told him about how to 
cure the fault The computer is interrogated to find the 



last time that the two people concerned were at the 
copier. Once this particular bit of information had been 
recovered, it was used to access the time-stamped 
audio records of what was said in the room containing 
5 the copier at that point of time. In this way, the enquirer 
had the opportunity of hearing again the conversation 
between him and his colleague. A system of the 
foregoing type also is described in a copending U.S. 
Patent Application of the instant inventors, which was 
10 filed 7 February 1991 under Serial No. 07/652, 159 on 
"Indexing of AudioA/isual Data". 

The aforementioned system has several practical 
limitations. These include the need on the part of the 
user to devote a lot of effort to kientHying a partk;ular 
IS time and place from the raw kientity button data, and 
- the lack of precise episode definition from such data. 
Also the system captures just the location of users, 
and not their activity or other significant episodes. 
' These drawbacks would make the system of relatively 
20 little practical value. 

Another relevant publication in this area of 
' technology is the article entitled TReflexive CSCW; 

supporting long-term personal work" by H Thimbleby, 
. S Anderson and I Witten, Interacting with Computers , 
25 volume 2, No. 3, December 1990, pages 330-336. 
The term "CSCW" means - computer-supported 
cooperative work. The section under the heading 
"Background,", on pages 331/2 of the article, refers to 
a method whereby a research worker is able to be 
30 given on request a record of workstatk>n activities he 
. was undertaking over a chosenperiod. The article 
suggests that this approach can generate useful infor- 
mation for retrieval purposes, but offers no practical 
means of doing so. Like the first mentioned system , 
35 it fails to recognise the need for analysis of the raw 
data to produce user-recognisable episodes. 

The present invention Is based on devising pat- 
tern-matching techniques, for constmcting recognis- 
at>le episodes from raw data. In this way it offers the . 
. 40 user the means to arrive quiddy at a particular point 
in one dataset, possitdy in order to be able to obtain 
access more efficiehtly and quickly into at least one 
other dataset This speed of access is one of the 
• essential:' requirements in making .the technk|ue of.. 
45 . practical value. 

The invention also takes into account situations 
where there may be relatively infrequent change of 
location by users, i.e. where location data wotid be of 
littie use in Identifying episodes with precisk)n. in 
so these situations, other forms of activity are IBcely to 
provide a more effective means of episode , kientifi- 
cation. The invention therefore presents means of 
instrumenting these activities and analysing the raw 
data, thus constructing recognisable episodes from 
55 these multiple sources of activity data. The invention 
also observes that retrieval often involves correlating 
activities of several different kinds, and therefore pro- 
vides means for doing so. 

2 



^ISDOCiD: <EP_0495ea2A2J_> 



3 



EP 0 495 622 A2 



4 



The present invention will now be described by 
way of example with reference to the accompanying 
drawings, in which: 

Fig. 1 is a diagrammatic view of a basic data-ret^ 
rieval system of the present invention, in which 
the raw activity-based data are broken down into 
episodes which may be selectively fed to various 
output devices; 

Fig. 2 is a view of a more-complex system than 
Fig. 1, using two or more of the basic systems of 
Fig 1 feeding into a compound episode recog- 
niser so as to be able to establish relationships 
between different forms of input data; 
Fig. 3 is a diagrammatic view of one system of the 
present invention using "active badges" to feed 
movement information into the system; 
Fig. 4 is a diagrammatic view of a sut>-system for 
using badge episoide data produced by the sys- 
tem of Fig. 3. to identify when two or more people 
form a gathenng. and for forming a travelogue of 
the movements of people carrying badges be^ 
tween different zones in the environment being 
monitored; 

Fig. 5 shows in <a) a diary of individual events in 
the life of one person, and in (b) these events slnrv 
plified by anrialgamating short periods alone into 
attendance at a gathering; 

Fig. 6 is a diagrammatic view of part of a system 
of the present invention in which the activity being 
monitored is the presence of various documents 
on the user's workspace, with at least descriptors 
of those documents being stored to complement 
the other data assisting in later retrieval- 
Figs. 7(a) to (c) are partial displays of a typical 
output of the Fig. 6 system; 
Fig. 8 is a diagrammatic view of a sub-system 
based on that system shown in Fig. 6, to break 
down the presence of documents on the work- 
. space, and the timing of the appearance on; and 
disappearance from, the workspace, to produce 
document-movement episodes; 
Fig. 9 is a diagrammatic view of a sub-system of 
the present invention in ; which the sounds- 
received and emitted by a person wearing a- 
microphone are subjected to a filtering actlon.to 
determine when the person carrying the micro- 
phone is talking, from when he is not By the use 
of a compound episode' recogniser as shown in • 
Fig. 2, conversations between two or more people 
can be recognised as such and can be used to 
form episodes for retrieval purposes; 
Figs. 10(a) and (b) are views of typical outputs 
from the Fig. 9 system,, showing how convers- 
ations are distinguished from concurrent talk; 
Fig. 1 1 is a diagrammatic view of a sub-system 
rather similar to that of Fig. 8, in which the input 
is of telephone calls made or received by an indi- 
vidual in the environment being monitored. By 



means of signals derived from incoming calls, or 
from utterances of the person being monitored, 
such phone calls can be formed into episodes, 
with any resultant diary including the names of the 

5 people with whom the person conversed over the 

telephone; inrespecthre of whether or not that per- 
son kientified himself by name, or was identified 
by name by the person being nnonltored; and 
Fig. 12 is a diagrammatic view of a sub-system in 

10 which the activity being monitored is that of the 

use of the electronic workstation of the person 
being ntonrtored. 

In the drawings, many parts are specifically label- 
led as well as bi&ing given reference numbers. When 

15 components or units are common to two or more of the 
Figures, they.retain their original references. 

The data-retrieval system of this invention is 
intended to operate on activity-based data from a 
choice of sources^ * . 

20 ■ As shown in Frg: 1, the data may be fed directly - 
to a memory 4, prior to processing by the episode rec- 
ogniser 6, or the recogniser might operate on raw data 
and store the. processed data in a memory 8. Because 
the inf omnia tk>n to be retrieved is not known at the time 

25 of data capture/ it Is necessary to be at>le to specify 
the appropriate characteristtes of the data to be ret- 
rieved by means of a- query station 10, having a 
keyboard :1 2 which the: user can use to alter the par- 
ameters of the riecdgniser 6 so that the raw data are 

30 processed into episodes having predetermined 
characteristics; • . . ' 

The output of the recogniser 6 is fed into a diary 
composer 14 that is adapted to process the episode 
data fed to it and produce output data in the form of 

35 diary informatk>n which is fed to a driver 16. Although 
the driver is shown as operating a printer 18, a visual 
display unit 20/ and a loudspeaker or other audk> 
device 22. only one, or none of these devices may be 
used in any specific embodiment, but there should 

40 •• always be at ieast one output device able to present 
the data or infonnatk>n to the user, and to enable him 

r. ; to klentify a portk)n of the data which he associates 
with a particular episode he is trying to recall, so that 

. the time-stamp of that episode then can be used to 

45 access video or audio recordings of the environment 
in which he was interested at the time of the episode 
in question. . . 

Although no specific time-keeping devk:e is 
shown in the Fig. 1 emt)odlment, it is to be understood 

50 that all forms of apparatus according to this invention 
. have to be able to operate on only time-stamped data. 
As most microprocessors which form components of 
the system of the present invention have built-in time 
clocks, whereby the microprocessor output is time- 

55 stamped, even if the user is not always able to see the 
time-stamp on first viewing, but perhaps only on 
demand, the drawings have omitted the dock input 
That apparatus shown in Fig. 2 uses two different 



3 



EP 0 495 622 A2 



sources 2 of raw of input data with two appropriately 
configured episode recognlsers 6. Each episode rec- 
ogniser feeds its output to a compound episode rec- 
ogniser 24, of which the episode-recognition 
parameters are set by a query device .10, with the 5 
queries being passed as necessary upstream to each 
episode recogniser 6 feeding its output to the com- 
pound recogniser 24. This involvement of the indh/i- 
dual recognisers with the compound recogniser is 
necessary in order to enable the latter to sort data into . io 
compound episodes involving data of two different 
types combined in a way specified by the query sta- 
tion 10. In a manner similar to the sub-system shown .. 
in Figure 1 , the compound recogniser 24 feeds its out- 
put to diary composer 14 and an output device driver 15 
1 6 connected to selected output devices (not shown). . 

As will be appreciated, the diary coiriposer 3 Is 
application-dependent, whereas the episode recog- 
niser 2 is data-dependent The compound recogniser 
accepts both simple queries, as In the Fig. 1 system, ,20 
as well as compound queries specifying relationships 
between episodes from different sources. 

The system shown in Fig. 3 is based. on the use 
of encoded identifiers, each intended to be carried by 
people working in the environment being monitored. . 25 
One particulariy-convenient form of identifier is the 
"active badge" offered by Olivetti . This, identifier takes 
the form of miniaturised circuitry in a housing able.to 
be pinned or clipped to the clothing of the person to : , 
whom the identifier is issued. The circuitry is des^ ned 30 
to emit pulse width modulated infra-red coded signals 
for a tenth of a second every 12 to 15 seconds. The 
signals, by their nature, have a range of about sue met- 
res, and will not travel through walls, which makes • 
them very useful for keeping track of people moving 35 
between rooms in a normal working environment In 
each of the rooms or corridors of interest (which would 
normally exclude toilets, lifts and like utility spaces) 
would be positioned orie or more sensors responsive 
to the infra-red output signals. The sensors would be 40 
connected to a master statton processor, which would - 
have the task of polling the sensors for Mentifi- 
cation 'sightings'; processing the data (which would 
Include decoding the signals to detenmine which per- 
son was within range of a specific sensor at the time . 45 
of polling), and presenting the processed data in a 
visual format. 

The badge system shown in Fig. 3 uses a network 
of sensors 26 of which there is at least one in each 
zone of the building or other environment being moni- so 
tored. In large rooms, the effective range of the 
badges might necessitate there being several sen- 
sors uniformly distributed over the room, as by t>eing 
mounted from the ceiling, to ensure that each occup- 
ant who is carrying a badge is in range of at least one 55 
sensor, so that the room has no 'blind spots'. The sen- 
sors 26 are connected to a polling device 28 which 
sends pulses to the sensors at chosen intervals, and 



receives from the sensors encoded signals indicating 
the badges within range of each sensor at the time of 
polling. The poller 28 supplies this badge information 
to a badge server 30 having its own data memory 32. 
The badge server 30 supplies its signals to a badge 
database 36, having a data memory 38, so that the 
badge signals are decoded and the klentify of the per- 
son allocated a badge becomes part of the badge sig- 
nal. The badge server 30 accepts the poller data and 
maps sensor signals into people and locations using 
its own database 32, and maps the detection of a par- 
ticular person In a particular location at a specific time 
into arrival and departure events. The database 36 
stores badge event data if appropriate. A badge 
episode recogniser 40 operates on the output from 
the database 36 to characterise the badge signals into 
three types of epteodes: gatherings; periods alone, 
and travel between or away from nrtohitored locations. 
Similariy to what has been already descn'bed, the rec- 
ogniser 40 has Its output signals treated by diary com- 
poser 14 and driver 16. 

The sub-system of Fig. 4 details the way in which 
the badge event signals may be processed. The first 
stage of the t>adge episode recogniser 40 of Fig. 3 is 
a gathering recogniser 42, which recewes badge 
event signals, and uses an event to keep a record of 
who is at each location, and notes tiie formation and 
dispersion of "gatherings". This term is a convenient 
one to use, as signing an episode worthy of being 
recorded. A working definition of a "gathering" Is that 
it is regarded as an - occasion which has started when 
two or more people are present concurrentiy at a 
specified location, for more than a specified minimum 
period, and at least one of the participants is not resi- 
dent at that location This implies that the location 
database forming part of the environment identifies 
which Is the office of each badge holder, so that hte 
presence in his office Is recognisably different from 
the presence of someone else in his office. The 
gatherings data are stored for use by a travel organ- 
iser 44, which receives; badge events and supplies 
signals to a stopover list 46; the stopovers including 
locations where more than a minimum . period was 
spent, locations at the terminus of a round trip of rpore 
than a minimum distance, and the locations of gather- 
ings. 

Data from a fioor-plan store 48 are used to recogr 
nise "round trips", by. which term is meant - when a 
person being monitored leaves his office and moves 
around the environment before returning to It The 
stopover list 46 incorporates details of gatherings that 
are fed to the diary composer 14. The diary composer 
14 operates as illustrated in Fig- 5. The raw data 
showing a smalt part of the nr>ovements of one Indivi- 
dual shown in Fig. 5 (a). This relatively-raw data would 
lead to an unnecessarily-long diary or event log. Thus 
the diary composer 14 operates on this informatk)n to 
combine sequences of consecutive episodes that 



NSOOCID: <EP 0495eZ2A2J_> 



7 EP 0 495 622 A2 8 



match specific patterns. Thus, for the activities of the 
person shown in Fig. 5 (a), the periods when he is 
alone in the conference room are treated as being part 
of the general gathering in that room, thus leading to 
the simplification shown in Fig. 5 (b). 5 

Fig. 6 shows the system which exploits scanning 
and image-processing technology to read the con- 
tents of documents on the desk surface; the user is 
then able to interact with these contents using a 
pointer such as a tablet and stylus. A wide range of io 
interaction techniques may be used, including menu 
selection and dialogue boxes, graphical selection and 
hand printed character recognition. Moreover,the 
user is also free simply to write ondocuments with a 
pen or with an ink pen in the stylus. Particular tasks is 
for which the system could be effective include: 
selecting key words for document filing; performing 
calculations on numeric data; translating foreign 
words, looking up technical temns in dictionaries, 
checking and filling in forms, retrieving other docu- 20 
ments or information that cannot be printed, e.g. audio 
or vMeo. The invention could therefore give desk 
workers and peo(^e attending meetings access to 
functb ns that are cun-ently available only via work sta- 
tions. The apparatus shown in Fig. 6 includes a video 25 
camera 50 and a video projector 52 mounted above 
a desk 54 at which the user sits, a data tablet 56 with 
a stylus 58, a htgh-resoiutton scanner 60, and a con- 
trol computer 62 with its data store or mfemory 64 for 
images. so 

The camera 50 is nKKjnted so that it field of view 
includes the document on the desk 54 in front of the 
user. The vkleo signal is digitised using a frame store, 
creating a bit map image of the document. Digital fil- 
tering techniques are applied to the image, first to - 3S 
determine size and rotation, and then to classify the 
document according to properties such as line spac- 
ing, line-length distribution, etc. Application of these 
techniques provides (a) a compact resolution-inde- 
pendent encoding of the images properties for use as 40 
a descriptor, and (b) orientation and position infor- 
mation. The data tablet 56 and stylus 58 pemiit the 
user to select items such as words or columns^ of nunv ^ 
bers. When the user makes a selection as by pressing^ 
down oh the tablet surface, the coordinates of the 45 
stylus 58 are transformed, using the inverse of the 
rotation and position Information, to determine the 
selection in the coordinates of the page. 

If the image scanned in from the camera 50 is of 
insufficient resolution to support Interaction, the docu- so 
ment can be scanned in at high resolution. This oper- 
ation is done via the sanner 60, and it couki be 
performed on the entire contents of the user's in-tray 
before a work session, or on an entire set of meeting 
papers before a meeting starts. The some digital fil- ss 
taring techniques as before are applied to the high- 
resolution page image to construct a descriptor and to 
correct for rotatbn. Optical character recognition 



(OCR) is applied to extract the text of the document 
This text is stored for later use, indexed by the docu- 
ment descriptor. The page coordinates of each text 
item are Included with the stored text 

The memory 64 contains layout information for 
each document page entered into the system, 
indexed by descriptor. The user need not apply high- 
resolutk>n scanning or OCR to a document page that 
is already stored. As soon as the descriptor has been 
generated from the video scanned data, the existence 
of a stored version can be determined. Layout infor- 
matk)n describing computer-generated documents 
can be written directiy to the memory 64, and descrip- 
tors generated, without recourse to scanning or OCR. 

The descriptor of each page of a multi-page docu- 
ment, or of e^ich single document, is non-unique, but 
is designed to ensure that descriptors are rarely dup- 
licated. Descriptbrs'fer pairs of document imag^ may 
be compared using a correlator to detenmine if the two 
images are of the same document When the user 
selects an ttem with the stylus, the corrected coordk 
nates are used to identify the text item in the stored 
version. The appropriate function (translation, index- 
ing, etc!) can then be applied to the item. The system 
can also supply hand-printing recognition to the stylus 
Input, or record changes to the scanned image result- 
ing from conventional handwriting. 

Feedback to the user is provided via the video 
pfojector'52. Examples of such feedback are shown 
in Fig. 7. Results of traristations (Fig. 7a) or calcu- 
lations (Fig. 7b) would normally be shown alongside 
the text If space permits (arid the system can deter- 
mine this) the results could be displayed directiy 
above the selected item. The vkieo projector 52 also 
displays menuis arid diak>gue boxes (see Fig. 7c) for 
interactive control. 

Each time the user places a new document on the 
tablet, or turns the pages of a multi-page document, 
the systerh cari detect the change in the video-scan- 
ned image arid then recompute the image's descrip- 
tor. It searches for a matching descriptor in its f9e of 
documents, thus if it finds a match, it retrieves the 
document If, on the other hand, no match is found, it 
sig nals to the usier to fe'ed the document into the scan- 
ner 60. 

This embodiment of the invention can be com- 
pared either with workstation-based interaction or 
with traditional paper-based wori< practice. As an 
alternative to workstation-based interaction, tt has the 
advantage that it can deal with any incoming paper 
document, or print-out of an electronic documertt 
Compared with paper-based methods, it has the 
advantage of providing access to a number of com- 
puter-based functbns and services. It has particular 
advantages in meeting rooms, where use of paper 
documents is a funmly-trenched practice. 

The "audit traS" left by the system provides a 
source of data about the user's interaction with paper 



JSDOCID: <EP_0495e22A2_l_> 



9 



EP 0 495 622 A2 



10 



documents. These data can be used to constiut 
episodes in which paper documents are identified 
either by selected key words or by opticaily-read con- 
tents. The subsystem shown in Fig. 8 uses the data 
from the computer 62 and armory 64 of the Fig. 6 5 
embodiment The computer feeds raw document 
events to a document episode recogniser 66, the 
events incfudtng descriptors, commands and selected 
data. The recogniser builds episodes from consecu> 
tive events applied to the same document; episode io 
descriptions include time-stamp, descriptor and type 
of episode (including reading, editing, filing, calculate - 
tng). The diary composer 14 takes episodes describ- 
ing individual document pages and builds episodes 
describing activity on multi-page documents and on. is 
folders of documents. 

While badges and other location sensors are able 
to detect that people are in the same room, they can- 
not tell whether these people fonm a group. Fig 9 
showns a sub-system having the ability to detect con- 20 
versations between people, without necessarily 
recording their voices, so as to provide a much more 
accurate indicator of group activity. Users typically 
remember conversations better than they, remember ^ 
other participants presence so the ability to detect that 25 
the conversation took place is likely to be useful for 
retrieval. In the Fig. 9 system, a microphone 68 is car- 
ried by a user, and the level of the audiO; signal pi:o- - 
duced by the microphone is monitored by nreans of a ■ 
circuit of the type used in cassette recorders. The cir- 30 
cuit controller 70 transmits a signal whenever the 
energy level of the audio signal exceeds a. threshold ^. 
representing the wearer's minimum voice level. . 
These signals, and their time of receipt, are. collected 
by the receiver 72 and passed to a conversation rec- 36 
ogniser 74. 

All of the energy data sets from a particular loca- 
tion are analysed together. The analysis seeks to 
identify periods of high energy as utterances, and to 
fit together utterances of two or more users s:o as to 40 
form continuous conversations. The methods draws 
on social science research results showing that con- 
versation involves relatively-little overlap and relath/e- 
ly-short pauses between turns. Thus the composite 
audio stream is more or less continuous, with mostly . 45 
only one speaker active. Fig. 10 (a) shows the audios- 
treams generated by participants in a typical convers- 
ation, while Fig. 10 (b) shows, in contrast, the 
audiostreams of two people engaged in convers- 
ations with others or in dictating, etc.. Each composite 50 
signal can be divided into periods of silence, non- 
overlapping speech, and overlapping speech, in true 
conversatton between two people, overlap would tend 
to represent less than 1 0 percent of all non-silence, 
while in random collections of speakers, the propor- 55 
tion will be considerably higher, at least 25 percent. 

Although this has not been shown in the draw- 
ings, an alternative method of capturing the convers- 



ation data is by radio telephony. Audio signals firom 
users' microphones are collected at the central loca- 
tion, their energy levels are measured, and the con- 
veiTiatiorvdetection analysis earned out This is a 
more-expensive approach that offers increased 
bandwidth, but it niay encounter channel limitations. 

The diary composer 14 reduces the list of 
episodes by excluding short episodes, and by linking 
conversations between identical groups. 

Fig. 11 shows a sut>-system that builds a log of 
incoming and outgoing telephone calls that includes 
the names of respondents. The proposed device 
could be attached to a user's telephone and would be 
capable of tniilding a text database, each record indi- 
cating the time and duration of a call, and both the cal- 
ler's and the recipient's names, with one of the names 
being generated by a speech-recognition system. The 
system shown in Fig. 11 uses input firom a telephone 
76 from which signals are picked off and fed to a 
speech recogniser 78 that has its output routed to a 
telephone call episode recogniser 80, with its output, 
in turn, going to the now-usual conriposer 14. Output 
from the composer 14 builds the log, of which a typical 
set of entries is shown at 84. 

This aspect of the invention relies on recognising 
names spoken by the user in responding to irKX>ming 
calls and in making outgoing calls. 

On making outgoing calls the speech recogniser 
78, which is monitoring the audio signal contimjously, 
will recognise an utterance of the form "Patrick 
Johnson please" or "Could 1 speak to Patrick", etc. As 
long as the person being called has corresponding 
voice identification data entered in a database 82, the 
utterance by the caller of the spoken name produces 
the audio signals wfiich are fed to recogniser 80. The 
recogniser 80 generates the recipienfs name as a 
text string. On receiving incoming calls, the speech 
recogniser 80 further responds to a recipienfs utter- 
ance of the form "Hello Patrick" or "Hello Mr Johnson" 
and generates the caller's name as a text string. This 
recogniser generated data is stored, so it may be used 
for subsequent manual confirmation by the user. 

The diary composer 14 may combine (X)nsecutive 
calls to the same person, and ignore wrong numbers 
and interception tiy switchboards, etc. If the caller is 
not known to the user, or if the user calls somebody 
whose data is not in database 82, the system cannot 
supply the identification data. There is also the pos- 
sibility that, in the case of an incoming call, the recf- 
pient may not. respond, such as when speaking to 
callers who are relatively strange to him, in the way for 
which the recogniser 80 is programmed. This vari- 
ation in response, or indeed any utterance in which 
the confidence of recognition is low, can be used to 
mark the caller as unknown in the log entry. 

The sub-system shown in Fig. 12 uses the activity 
of a user's workstation to generate episode data. A 
workstation activity episode recogniser 86 has a data 



NSDOaO: <EP__04956a2A2_L> 



11 EPI 

memory 88. The recogniser has various fomns of input 
from a workstation, as shown in the drawing. One 
input might relate to file activity, another to processor 
focus, and a third to program input focus. These 
inputs , are from daemons or watchdog processes. 
These monitor activity such as: the creation, reading, 
modifying or deleting of files; the name of a program 
consuming processor resource in a given period, or 
changes in program input focus (in a window-based 
user interface). The recogniser 86 translates sequ- 
ences of activity into user-recognisable episodes. 
These include: editing a document; reading, answer- 
ing and filing mail; developing a program; and retriev- 
ing information from a datat)ase. The recogniser 86 
applies pattern-matching to recognise common 
tasks . i.e. "change input focus to editor -> change 
processor focus to editor read fHe -> write file -> 
write file change input focus", would be recognised 
as being the steps involved iri editing a document The 
composer 14 combines simtliar episodes and removes 
insignificant episodes. 

Users can submit queries to each of the recognis- 
ers described above by means of its respective query 
input 10. A query will specify an episode or episodes 
of interest, induding a pattern of episodes, using key 
words and descriptions appropriate to the recogniser, 
for example: 

- with KBW in Commons last week; 

- document read after reading Mid-year Report; - 

- talking to KBW and MGL; 

- calls to MGL yesterday; and 

- reading mail yesterday. 

In the case of the compound recogniser 24 (Fig 
2) such a query might invoh^e a pattern of the form: 

- sending mail to KBW after meeting with MGL. 
Although the different embodiments of the ihven- • 

tion are implemented in the form of computer code, 
which Is riot included in this specification, it is never- ' 
theless believed that the above description and draw- 
ings are sufficient to enat>le a man skilled in the art of 
designing software systems to put this invention into 
practice. 

Claims ' • 

1. A system for providing for fast random access to 
a sequence of time-stamped data records of 
events in a specific environrrient, comprising: 

means for collecting data In order to con- 
struct thfne-stamped records indicating the loca- 
tions of persons in the environment; 

means for encoding and storing the 
records for later selective retrieval and decoding; 

means for specifying a pattern characteri- 
sing an episode as defined, arid for receiving and 
decoding matching records; 

means for applying rules to the records to 



495 622 A2 12 

arrive at records of episodes notching a specified 
search pattern, in which each episode record is 
made up of periods spent alone by the person 
concerned; in travel within the environment, and 
5 in attendance at gatherings, and 

means for requesting transformation into 
human-readable form of the episode records cor- 
responding to the specified pattern. 

10 2. A system as claimed in daim 1 , in which the envi- 
ronment is divided into several zones, and includ- 
ing: 

means for monitoring one or more of the 
zones to detect the presence within the zone of, 
IS and to identify; a person. 

3. A system as claimed in daim 1 or 2, in which at 
least some of the people in the environment cany 
a personal identifier from which data can be col- 

20 iected by one 6ir more monitors of the environ- 

ment " 

4. A system as- daimed in daim 3, in which each 
identifier is in the'fomn of a transmitter of identlft^ 

25 catioh signals to which each monitor is sensitive. 

5. A system for providing for fast random access to 
a sequence of time-stamped data records of 
events in a specific -environment, comprising: 

3d means for collecting data in order to con- 

struct time?-istamped records identifying scann- 
able document pages and indicating operations 
on the pages at locations in the environment; 

means for encoding and storing the 
35 records for later selective retrieval and decoding; 

means for enabling identification of the 
document contents from a record. 



6. A system as daimed in daim 5, induding means 
40 for projecting- a display on to the surface of the 

docunrient or oh' to a SLoface t>ordering the docu- 



ment 




7. A system as daimed in daim 5 or 6, induding 
45 means for manually selecting from within this 

document or for displaying a corinputer function to 
be applied to the document. 

8. A system as daimed in any of daims 5 to 7. 
50 including means for manually selecting one or 

more items on the identified document to be used 
as input to a computer function to t>e applied to 
the document 

55 9. A system for providing for fast random access to 
asequence of time-stamped data records of 
activities in a specific environment . comprising: 
means for collecting data in order to con- 



ISDOCID: <EP_0495622A2_I_> 



13 



EP0495 622 A2 



14 



struct trnie-stamped records of periods of vocal 
sounds by people in the environnnent; 

means for operating on those records to 
determine when a conversation taking place 
involving one or rnore of the said people, and for s 
generating time-stamped records of such deter- 
minations; 

means for encoding and storing the 
records for later selective retrieval and decoding, 
and 10 

means for enabling identification of the col-' 
lected conversation data from a record. 

10. A system for providing for fast random access to 

a sequence of time>stamped data records of is 
activities in a specific environment, comprising: 

means for collecting audio signal and other 
•data from a person in the environment placing or 
receiving a phone call in order to construct time- 
stamped records of calls; 20 

means for feeding audio signals from 
these records to a speech recogniser capable of 
identifying the other participant in the call from the 
words uttered by the person in the environment, 
and for adding this identification to the records; 25 

means for encoding and- storing the 
records for later selective retrieval and decoding, 
and 

means for enabling identification of the call 
data from a record. 30 



records of what was said in each of the zones. 

14. A system as claimed in any preceding daim, cap- 
able of generating episodes of two or more types, 
including means for anriving at episode records 
satisfying a specified relationship between 
episodes of two or more types. 

15. A system as claimed in any preceding claim, 
including: 

means for specifying a pattern characteri- 
sing an episode as defined, and forrecehnng and 
decoding matching records; 

means for applying rules to the records to 
arrive at records of episodes matching a specified 
search pattern, and 

means for requesting transformation into 
human-readable form of the episode records cor- 
responding to the specified pattern. 



11- A system for providing for fast random access to 
a sequence of time-stamped data ; records of 
activities in a specific environment, comprising: 

means for collecting data in order to con- 
struct time-stam ped records of the activity of each 
workstation associated with each person in the 
environment, such as the creation, reading, modi- 
fication or deletk>n of electronic files; 

means for encoding and storing the 
records for later selective retrieval and decoding; 

means for specifying a. pattern chiaracteri- 
sing an episode, and for receiving and decoding 
matching records, and 

means for enabling identification of the col- 
lected data from a record. 



36 



40 



45 



12. A system as dainrted in daims 4 and 9, in which 
each transmitter has associated with it a micro- 
phone for monitoring the level of the audio signals 
produced by the person carrying the transmitter, 
the microphone circuit having a filter for rejecting 
all signals of less than a threshold level, and 
transmitting data describing periods of vocal 
sounds greater than the threshold level. 



50 



55 



1 3. A system as daimed in any preceding daim, com- 
prising means for produdng at least auditory 



>ISDOCID: <EP_0495e22A2J_> 



EP 0 495 622 A2 



'X 



RAW 
DATA 



D 
10 -t: 



EPISODE 
RECOGNISER 




>12 



T 



F/g.7. 



DIARY 
COMPOSER 




8 



OUTPUT 

DEVICE 

DRIVER 



22 



20 



Fig. 2 



24 



COMPOUND 
EPISODE 

RECOGNISER 



'10 



-12 



16. 



FJg. 7(a). 



Fig. 7. 



correspondant aux 
[outdLLs] qui se r tools 

developpent le plus 
souvent autour de la 



INDEX 
CALCULATE 
5RAN§LAIEj 
DICTIONARY 
FILL IN 



Fig. 7(c). 



|217.60~l 
I 33.58 ! 



226.03 



=1092.61 



I 



i.3_15J.g_J 



SOOCID: <EP ^0495622A2_L> 



EP 0 495 622 A2 



26. 26 Fig. 3. 

r-n — ci — 



POLLER 



28 



BADGE 
SERVER 




I 



30 



36 

I 



40 



BADGE 
DATABASE 



34 




BADGE 
EPISODE 
RECOGNISERI 



38 

10- 
QUERY- 









' r 







16 



Fig J, 



. 42 



GATHERING 
RECOGNISER 




44. 


"1 




TRAVEL 




STOPOVERS 






RECOGNISER 









48. 



FLOOR 
PLAN 



10 



MSOOCID: <EP_(M85622*2J_> 



EP 0 495 622 A2 

Fig.5(a). 



11:15-12:01 ALONE IN OFFICE 
12:01-12:02 ALONE IN CONFERENCE Rm 
12:02-13:15 GATHERING IN CONFERENCE Rm 
13:15-13:16 ALONE IN CONFERENCE Rm 
13:16-14:30 ALONE IN OFFICE 




Fig. 5(b). 



11:15-12:01 ALONE IN OFFICE 

12:01-13:16 GATHERING IN CONFERENCE Rm 

13:16-14:30 ALONE IN OFFICE 




Fig. 6: 




11 



EP 0 485 622 A2 



62- 



66- 



Fig. 8 




At* 



16- 







DOCUMENT 

EPISODE 

RECOGNISER 























Fig. 9. 



68 



70 



CONTROLLER 













CONVERSATION 
RECOGNISER 



72 



T 



74 



COMPOSER 

7— 



14 



Fig. 10(a). 

AUDIO SIGNALS FROM A TRUE CONVERSATION 
►TIME 

SPEAKER A 



llllliiliiiilHIililllllllilliHIIII^HIIillllllllliil 



SPEAKERS n nn nmii I M 



Fig. 10(b) . 



SPEAKER A nnnnfliii iiiiiiihihiii iiiiiiimmmiiiii ed miuinuiiD 

SPEAKER B OID QUID glllllllllliHI 



llillililllllli 



12 



'JSDCX:tD: <EP 048SBggAg t > 



EP 0 495 622 A2 



Fig. 11. 




PHONE 
SPEECH CALL 
RECOG- - EPISODE 



NISER 



76 



RECOG- 
NISER 




78 




80 



82 



84- 



14:17 JANE ROBERTS 
14:56 PETER FRENCH 
15:33 TOM SMITH 
15:45 PATRICK JOHNSON 



Fig. 12. 



FILE ACTIVITY 



PROC. FOCUS 



PROGRAM INPUT 



WORKSTATION 
ACTIVITY 
EPISODE 
RECOGNISER 



88 



-10 




'16 



13 



BDOCID: <EP_0485aZ2A2_I_?- 



THIS PAGE BUMIC(MS"'» 



Europaisches Patentamt 
(19) ^Xyff Eui^P^^" Patent Office 

Office europeen des brevets 





(Ti) Publication number : 0 495 622 A3 



EUROPEAN PATENT APPLICATION 



2i) Application number : 92300302.4 
g) Date of filing : 14.01.92 



@ Int. ci.^ G06F 15/401, G06F 15/403 



(g) Priority : 14.01.91 GB 9100733 

@ Date of publication of application : 
22.07.92 Bulletin 92/30 



(S) Designated Contracting States : 
DE FR GB 



(S) Date of deferred publication of search report : 
06.10.93 Bulletin 93/40 



(g) Applicant : XEROX CORPORATION 
Xerox Square 

Rochester New York 14644 (US) 



@ Inventor: Lamming, Michael G. 
179 HUls Road 
Cambridge CB2 2RN (GB) 
Inventor : Newman, William Maxwell 
Yew Tree Cottage, Famham 
Biandford Forum, Dorset (GB) 
Inventor : Wellner, Pierre 
57 Carlyle Road 
Cambridge CB4 3DH (GB) 

@ Representative : Goode, Ian Roy at ai 

Rank Xerox Patent Department Albion House 
55 New Oxford Street 
London WC1A IBS (GB) 



@ indexing of data sets. 



@ A system is disclosed for facilitating later 
access to a F>ortk>n of a datastream for retrieval 
purposes based upon klentifying "episodes" in 
data from other sources related to a user's 
activities. The stored data, in raw or "episodic " 
form, are accessed as desired using parameters 
based on recollections of the user's or datat>ase 
interrogator's activities at the time of the event 
whkih the user or interrogator wishes to access 
or display. Such other sources may include 
information at>out the movements, gatherings, 
conversations, phone calls, etc of users in a 
p^cular environment fitted with the system. 



CO 

< 

CM 
CM 
CO 

ID 



a. 




Jouve, 18, me Salnt>Denis, 75001 PARIS 



ISDOCIO. <EP ^0495622A3J_> 



EP 0 495 622 A3 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



EP 92 30 0302 
Page 1 



DOCUMENTS CONSIDERED TO BE RELEVANT 




Category 


CtatiDB »f dtcmneat with imJw ■tfaia, wfccre iyio»ri«tff, 
of rdevnt passaffs 


Rctemt 
to i l>iin 


(XAsancATioN of the 

^T^^" ■ " ■ ~ ^ini ^ ii i> p 


X 
Y 


US-A-6 479 631 (DINNAT R.) 
* the whole document * 


1.2 
3.4 


606F15/401 
G06F15/403 


Y 


PATENT ABSTRACTS OF JAPAN 

voK 010. no. 256 (P-493)2 September 1986 

& JP-A-61 082 287 ( TOSHIBA ENG CO LTD ) 

25 April 1986 

* abstract * 


3,4 




X 


US-A-4 468 694 (EDGAR A.) 
* the whole document * 


5-8 




X 


EP-A-O 175 503 (BRITISH TELECOMMUNICATIONS 
PLC) 

* the whole document * 


9.10.12. 
13 




X 


PATENT ABSTRACTS OF JAPAN 

vol. 012, no. 293 (E--645)10 August 1988 

& JP-A-63 067 949 ( FUJITSU LTD ) 26 March 


9.10,12. 
13 






1988 

* abstract * 




lEOiraCAl, nELDS 
SEAKOIED OBt. CLS ) 


X 


PC TECH JOURNAL 

vol. 6, no. 2, February 1988. USA 
pages 126 - 134 

SAWICKI E. *The LAN Audit Trail' 


11,14.15 


G66F 


X 


IEEE PROCEEDINGS OF THE 6TH ANNUAL 
COMPUTER SECURITY APPLICATIONS CONFERENCE. 
CAT. NO. 90TH0251-7, 7 December 1990. 
TUCSON. AZ. USA 
pages 260 - 272 

MCAULIFFE ET AL. »Is Your Computer Being 
Misused ? A Survey of Current Intrusion 
Detection System Technology' 


11,14,15 




The prac&t BemrA rrport has been ikvwn for all danas 







THE HAGUE 



06 AUGUST 1993 



SUENDERMANN R.O. 



CATEGORY OF CVTCD DOOMENTS 

X : pailicBlaily rdevant if lakca alooe 
Y : pflionlarty raUvaaC if combiMd mth inBlb«r 

dDcnmac of saia« c 
A : fchuolog>c*l bacft^fonad 

O: 
P: 



T : tbcoiy or pftodplc naicffyiBf the invvBiioa 
E : earlier patait iiw ui i wint, bat b ' " ' ' 



D : docum At dta« la the m f tpBritfa m 
L : ioeammt dM ior oCb« rasoos 



of lb« 



MSDOCID: <EP_0495e22A3J_> 



EP 0 485 £22 A3 



Europeao PateoC 
Office 



EUROPEAN SEARCH REPORT 



EP 92 30 0302 
Page 2 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Catccvry 



CiMioDor 



GB-A-2 142 500 (UNITED KINGOOH ATOMIC 

ENERGY AUTHORITY) 

■ the whole document * 



US-A-4 884 132 (MORRIS ET AL.) 
* the whole docuaent * 



Tbe present vcmrch report has 



ttraiwB up for alli 



CLASSIFICATION OF THE 
APfUCATKlN Quu dS ) 



TECHNICAL FIilX>S 
SEARCHED Ont. CLS > 



THE HAGUE 



06 AUGUST 1993 



SUENDERMANN R.O. 



s 

S 
«• 

o 

Si 



CATEGOKY OF CTTEO DOCUMENTS 

X : ptrticBlBriy ralevut tf ttken aloM 

Y : parttodariy nlev»Bt tf coaUMrf with aaolhflr 

decmscst of tb« swae cataigRfy 
A : Mcteologicml bttck^ouDtf 
O : mm-wrtnat «Us<«Brare 
P ; jBHi'Mrfiilc docuntat 



T : tt«ary or prindple iiBtelviBg tfav 
D : docaacnt dtctf in dM applkartton 



!SDCX:»D: <EP_0485e22A3_l_: 




This Page Blank (uspto) 



